Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 4998 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.1 MiB |
| Average record size in memory | 224.0 B |
Variable types
| Categorical | 3 |
|---|---|
| Text | 9 |
| Numeric | 15 |
color is highly imbalanced (75.1%) | Imbalance |
language is highly imbalanced (88.8%) | Imbalance |
budget is highly skewed (γ1 = 50.46992572) | Skewed |
director_facebook_likes has 897 (17.9%) zeros | Zeros |
actor_3_facebook_likes has 89 (1.8%) zeros | Zeros |
actor_2_facebook_likes has 55 (1.1%) zeros | Zeros |
movie_facebook_likes has 2162 (43.3%) zeros | Zeros |
Reproduction
| Analysis started | 2024-03-17 00:42:15.818007 |
|---|---|
| Analysis finished | 2024-03-17 00:43:02.917350 |
| Duration | 47.1 seconds |
| Software version | ydata-profiling vv4.6.5 |
| Download configuration | config.json |
color
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| Color | |
|---|---|
| Black and White | 207 |
Length
| Max length | 16 |
|---|---|
| Median length | 5 |
| Mean length | 5.4555822 |
| Min length | 5 |
Characters and Unicode
| Total characters | 27267 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Color |
|---|---|
| 2nd row | Color |
| 3rd row | Color |
| 4th row | Color |
| 5th row | Color |
Common Values
| Value | Count | Frequency (%) |
| Color | 4791 | |
| Black and White | 207 | 4.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| color | 4791 | |
| black | 207 | 3.8% |
| and | 207 | 3.8% |
| white | 207 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 9582 | |
| l | 4998 | |
| C | 4791 | |
| r | 4791 | |
| 621 | 2.3% | |
| a | 414 | 1.5% |
| B | 207 | 0.8% |
| c | 207 | 0.8% |
| k | 207 | 0.8% |
| n | 207 | 0.8% |
| Other values (6) | 1242 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21441 | |
| Uppercase Letter | 5205 | 19.1% |
| Space Separator | 621 | 2.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 9582 | |
| l | 4998 | |
| r | 4791 | |
| a | 414 | 1.9% |
| c | 207 | 1.0% |
| k | 207 | 1.0% |
| n | 207 | 1.0% |
| d | 207 | 1.0% |
| h | 207 | 1.0% |
| i | 207 | 1.0% |
| Other values (2) | 414 | 1.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 4791 | |
| B | 207 | 4.0% |
| W | 207 | 4.0% |
Space Separator
| Value | Count | Frequency (%) |
| 621 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26646 | |
| Common | 621 | 2.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 9582 | |
| l | 4998 | |
| C | 4791 | |
| r | 4791 | |
| a | 414 | 1.6% |
| B | 207 | 0.8% |
| c | 207 | 0.8% |
| k | 207 | 0.8% |
| n | 207 | 0.8% |
| d | 207 | 0.8% |
| Other values (5) | 1035 | 3.9% |
Common
| Value | Count | Frequency (%) |
| 621 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27267 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 9582 | |
| l | 4998 | |
| C | 4791 | |
| r | 4791 | |
| 621 | 2.3% | |
| a | 414 | 1.5% |
| B | 207 | 0.8% |
| c | 207 | 0.8% |
| k | 207 | 0.8% |
| n | 207 | 0.8% |
| Other values (6) | 1242 | 4.6% |
director_name
Text
| Distinct | 2399 |
|---|---|
| Distinct (%) | 48.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 24 |
| Mean length | 12.966186 |
| Min length | 3 |
Characters and Unicode
| Total characters | 64805 |
|---|---|
| Distinct characters | 76 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1513 ? |
|---|---|
| Unique (%) | 30.3% |
Sample
| 1st row | James Cameron |
|---|---|
| 2nd row | Gore Verbinski |
| 3rd row | Sam Mendes |
| 4th row | Christopher Nolan |
| 5th row | Doug Walker |
| Value | Count | Frequency (%) |
| john | 178 | 1.7% |
| david | 148 | 1.4% |
| michael | 126 | 1.2% |
| unknown | 103 | 1.0% |
| james | 87 | 0.8% |
| robert | 84 | 0.8% |
| peter | 84 | 0.8% |
| richard | 80 | 0.8% |
| paul | 76 | 0.7% |
| scott | 65 | 0.6% |
| Other values (2967) | 9254 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6045 | 9.3% |
| 5287 | 8.2% | |
| a | 5237 | 8.1% |
| n | 4928 | 7.6% |
| r | 4415 | 6.8% |
| o | 3861 | 6.0% |
| i | 3665 | 5.7% |
| l | 2947 | 4.5% |
| t | 2297 | 3.5% |
| s | 2070 | 3.2% |
| Other values (66) | 24053 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 48774 | |
| Uppercase Letter | 10398 | 16.0% |
| Space Separator | 5287 | 8.2% |
| Other Punctuation | 259 | 0.4% |
| Dash Punctuation | 87 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6045 | |
| a | 5237 | |
| n | 4928 | |
| r | 4415 | 9.1% |
| o | 3861 | 7.9% |
| i | 3665 | 7.5% |
| l | 2947 | 6.0% |
| t | 2297 | 4.7% |
| s | 2070 | 4.2% |
| h | 1833 | 3.8% |
| Other values (31) | 11476 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 995 | 9.6% |
| J | 916 | 8.8% |
| M | 881 | 8.5% |
| R | 752 | 7.2% |
| C | 704 | 6.8% |
| B | 669 | 6.4% |
| D | 614 | 5.9% |
| A | 564 | 5.4% |
| L | 496 | 4.8% |
| P | 481 | 4.6% |
| Other values (21) | 3326 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 238 | |
| ' | 21 | 8.1% |
Space Separator
| Value | Count | Frequency (%) |
| 5287 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 87 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59172 | |
| Common | 5633 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6045 | 10.2% |
| a | 5237 | 8.9% |
| n | 4928 | 8.3% |
| r | 4415 | 7.5% |
| o | 3861 | 6.5% |
| i | 3665 | 6.2% |
| l | 2947 | 5.0% |
| t | 2297 | 3.9% |
| s | 2070 | 3.5% |
| h | 1833 | 3.1% |
| Other values (62) | 21874 |
Common
| Value | Count | Frequency (%) |
| 5287 | ||
| . | 238 | 4.2% |
| - | 87 | 1.5% |
| ' | 21 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 64663 | |
| None | 142 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 6045 | 9.3% |
| 5287 | 8.2% | |
| a | 5237 | 8.1% |
| n | 4928 | 7.6% |
| r | 4415 | 6.8% |
| o | 3861 | 6.0% |
| i | 3665 | 5.7% |
| l | 2947 | 4.6% |
| t | 2297 | 3.6% |
| s | 2070 | 3.2% |
| Other values (46) | 23911 |
None
| Value | Count | Frequency (%) |
| é | 45 | |
| á | 19 | |
| ö | 16 | 11.3% |
| ó | 16 | 11.3% |
| í | 8 | 5.6% |
| ñ | 7 | 4.9% |
| å | 6 | 4.2% |
| ç | 5 | 3.5% |
| É | 3 | 2.1% |
| Ô | 2 | 1.4% |
| Other values (10) | 15 | 10.6% |
num_critic_for_reviews
Real number (ℝ)
| Distinct | 528 |
|---|---|
| Distinct (%) | 10.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 139.59704 |
| Minimum | 1 |
|---|---|
| Maximum | 813 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 50 |
| median | 110 |
| Q3 | 193 |
| 95-th percentile | 385.15 |
| Maximum | 813 |
| Range | 812 |
| Interquartile range (IQR) | 143 |
Descriptive statistics
| Standard deviation | 120.9164 |
|---|---|
| Coefficient of variation (CV) | 0.86618168 |
| Kurtosis | 2.9659474 |
| Mean | 139.59704 |
| Median Absolute Deviation (MAD) | 67 |
| Skewness | 1.5297719 |
| Sum | 697706 |
| Variance | 14620.775 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 110 | 68 | 1.4% |
| 1 | 42 | 0.8% |
| 9 | 36 | 0.7% |
| 5 | 36 | 0.7% |
| 10 | 35 | 0.7% |
| 8 | 34 | 0.7% |
| 12 | 34 | 0.7% |
| 16 | 33 | 0.7% |
| 81 | 32 | 0.6% |
| 43 | 31 | 0.6% |
| Other values (518) | 4617 |
| Value | Count | Frequency (%) |
| 1 | 42 | |
| 2 | 26 | |
| 3 | 24 | |
| 4 | 29 | |
| 5 | 36 | |
| 6 | 28 | |
| 7 | 23 | |
| 8 | 34 | |
| 9 | 36 | |
| 10 | 35 |
| Value | Count | Frequency (%) |
| 813 | 1 | |
| 775 | 1 | |
| 765 | 1 | |
| 750 | 2 | |
| 739 | 1 | |
| 738 | 1 | |
| 733 | 1 | |
| 723 | 1 | |
| 712 | 1 | |
| 703 | 1 |
duration
Real number (ℝ)
| Distinct | 191 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 107.20068 |
| Minimum | 7 |
|---|---|
| Maximum | 511 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 81 |
| Q1 | 93 |
| median | 103 |
| Q3 | 118 |
| 95-th percentile | 146 |
| Maximum | 511 |
| Range | 504 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 25.211904 |
|---|---|
| Coefficient of variation (CV) | 0.23518418 |
| Kurtosis | 22.645561 |
| Mean | 107.20068 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 2.346182 |
| Sum | 535789 |
| Variance | 635.64013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 160 | 3.2% |
| 100 | 139 | 2.8% |
| 101 | 136 | 2.7% |
| 98 | 135 | 2.7% |
| 97 | 131 | 2.6% |
| 93 | 127 | 2.5% |
| 94 | 124 | 2.5% |
| 99 | 123 | 2.5% |
| 95 | 122 | 2.4% |
| 103 | 116 | 2.3% |
| Other values (181) | 3685 |
| Value | Count | Frequency (%) |
| 7 | 2 | < 0.1% |
| 11 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 22 | 7 | |
| 23 | 2 | < 0.1% |
| 24 | 2 | < 0.1% |
| 25 | 4 | |
| 27 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 511 | 1 | |
| 334 | 1 | |
| 330 | 1 | |
| 325 | 1 | |
| 300 | 1 | |
| 293 | 1 | |
| 289 | 1 | |
| 286 | 1 | |
| 280 | 1 | |
| 271 | 1 |
director_facebook_likes
Real number (ℝ)
ZEROS 
| Distinct | 435 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 675.4964 |
| Minimum | 0 |
|---|---|
| Maximum | 23000 |
| Zeros | 897 |
| Zeros (%) | 17.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 7 |
| median | 49 |
| Q3 | 189 |
| 95-th percentile | 964 |
| Maximum | 23000 |
| Range | 23000 |
| Interquartile range (IQR) | 182 |
Descriptive statistics
| Standard deviation | 2793.8965 |
|---|---|
| Coefficient of variation (CV) | 4.1360642 |
| Kurtosis | 27.785224 |
| Mean | 675.4964 |
| Median Absolute Deviation (MAD) | 49 |
| Skewness | 5.2780979 |
| Sum | 3376131 |
| Variance | 7805857.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 897 | 17.9% |
| 49 | 124 | 2.5% |
| 3 | 70 | 1.4% |
| 6 | 66 | 1.3% |
| 7 | 64 | 1.3% |
| 2 | 63 | 1.3% |
| 4 | 60 | 1.2% |
| 11 | 58 | 1.2% |
| 10 | 53 | 1.1% |
| 5 | 52 | 1.0% |
| Other values (425) | 3491 |
| Value | Count | Frequency (%) |
| 0 | 897 | |
| 2 | 63 | 1.3% |
| 3 | 70 | 1.4% |
| 4 | 60 | 1.2% |
| 5 | 52 | 1.0% |
| 6 | 66 | 1.3% |
| 7 | 64 | 1.3% |
| 8 | 52 | 1.0% |
| 9 | 49 | 1.0% |
| 10 | 53 | 1.1% |
| Value | Count | Frequency (%) |
| 23000 | 1 | < 0.1% |
| 22000 | 8 | 0.2% |
| 21000 | 10 | 0.2% |
| 20000 | 1 | < 0.1% |
| 18000 | 4 | 0.1% |
| 17000 | 20 | |
| 16000 | 28 | |
| 15000 | 2 | < 0.1% |
| 14000 | 30 | |
| 13000 | 26 |
actor_3_facebook_likes
Real number (ℝ)
ZEROS 
| Distinct | 906 |
|---|---|
| Distinct (%) | 18.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 638.65426 |
| Minimum | 0 |
|---|---|
| Maximum | 23000 |
| Zeros | 89 |
| Zeros (%) | 1.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 134 |
| median | 369 |
| Q3 | 634.75 |
| 95-th percentile | 1000 |
| Maximum | 23000 |
| Range | 23000 |
| Interquartile range (IQR) | 500.75 |
Descriptive statistics
| Standard deviation | 1639.6146 |
|---|---|
| Coefficient of variation (CV) | 2.5672961 |
| Kurtosis | 61.39995 |
| Mean | 638.65426 |
| Median Absolute Deviation (MAD) | 247 |
| Skewness | 7.3191584 |
| Sum | 3191994 |
| Variance | 2688336 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 125 | 2.5% |
| 0 | 89 | 1.8% |
| 11000 | 29 | 0.6% |
| 3 | 27 | 0.5% |
| 2000 | 27 | 0.5% |
| 3000 | 26 | 0.5% |
| 369 | 24 | 0.5% |
| 7 | 21 | 0.4% |
| 4 | 21 | 0.4% |
| 826 | 21 | 0.4% |
| Other values (896) | 4588 |
| Value | Count | Frequency (%) |
| 0 | 89 | |
| 2 | 21 | 0.4% |
| 3 | 27 | 0.5% |
| 4 | 21 | 0.4% |
| 5 | 18 | 0.4% |
| 6 | 18 | 0.4% |
| 7 | 21 | 0.4% |
| 8 | 17 | 0.3% |
| 9 | 16 | 0.3% |
| 10 | 12 | 0.2% |
| Value | Count | Frequency (%) |
| 23000 | 2 | < 0.1% |
| 20000 | 1 | < 0.1% |
| 19000 | 4 | 0.1% |
| 17000 | 1 | < 0.1% |
| 16000 | 3 | 0.1% |
| 15000 | 1 | < 0.1% |
| 14000 | 6 | 0.1% |
| 13000 | 5 | 0.1% |
| 12000 | 7 | 0.1% |
| 11000 | 29 |
actor_2_name
Text
| Distinct | 3033 |
|---|---|
| Distinct (%) | 60.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 28 |
|---|---|
| Median length | 25 |
| Mean length | 13.057423 |
| Min length | 3 |
Characters and Unicode
| Total characters | 65261 |
|---|---|
| Distinct characters | 80 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2104 ? |
|---|---|
| Unique (%) | 42.1% |
Sample
| 1st row | Joel David Moore |
|---|---|
| 2nd row | Orlando Bloom |
| 3rd row | Rory Kinnear |
| 4th row | Christian Bale |
| 5th row | Rob Walker |
| Value | Count | Frequency (%) |
| michael | 102 | 1.0% |
| david | 59 | 0.6% |
| john | 56 | 0.5% |
| james | 53 | 0.5% |
| scott | 51 | 0.5% |
| tom | 50 | 0.5% |
| robert | 43 | 0.4% |
| jason | 43 | 0.4% |
| kevin | 41 | 0.4% |
| thomas | 39 | 0.4% |
| Other values (3826) | 9784 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6152 | 9.4% |
| a | 5882 | 9.0% |
| 5323 | 8.2% | |
| n | 4748 | 7.3% |
| r | 4365 | 6.7% |
| i | 3987 | 6.1% |
| o | 3632 | 5.6% |
| l | 3389 | 5.2% |
| t | 2326 | 3.6% |
| s | 2139 | 3.3% |
| Other values (70) | 23318 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 49092 | |
| Uppercase Letter | 10591 | 16.2% |
| Space Separator | 5323 | 8.2% |
| Other Punctuation | 185 | 0.3% |
| Dash Punctuation | 64 | 0.1% |
| Decimal Number | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6152 | |
| a | 5882 | |
| n | 4748 | |
| r | 4365 | |
| i | 3987 | 8.1% |
| o | 3632 | 7.4% |
| l | 3389 | 6.9% |
| t | 2326 | 4.7% |
| s | 2139 | 4.4% |
| h | 1785 | 3.6% |
| Other values (38) | 10687 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 990 | 9.3% |
| S | 812 | 7.7% |
| C | 805 | 7.6% |
| B | 766 | 7.2% |
| J | 765 | 7.2% |
| D | 657 | 6.2% |
| A | 637 | 6.0% |
| R | 586 | 5.5% |
| L | 506 | 4.8% |
| T | 458 | 4.3% |
| Other values (16) | 3609 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 122 | |
| ' | 63 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 3 | |
| 0 | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 5323 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 64 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59683 | |
| Common | 5578 | 8.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6152 | 10.3% |
| a | 5882 | 9.9% |
| n | 4748 | 8.0% |
| r | 4365 | 7.3% |
| i | 3987 | 6.7% |
| o | 3632 | 6.1% |
| l | 3389 | 5.7% |
| t | 2326 | 3.9% |
| s | 2139 | 3.6% |
| h | 1785 | 3.0% |
| Other values (64) | 21278 |
Common
| Value | Count | Frequency (%) |
| 5323 | ||
| . | 122 | 2.2% |
| - | 64 | 1.1% |
| ' | 63 | 1.1% |
| 5 | 3 | 0.1% |
| 0 | 3 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 65140 | |
| None | 121 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 6152 | 9.4% |
| a | 5882 | 9.0% |
| 5323 | 8.2% | |
| n | 4748 | 7.3% |
| r | 4365 | 6.7% |
| i | 3987 | 6.1% |
| o | 3632 | 5.6% |
| l | 3389 | 5.2% |
| t | 2326 | 3.6% |
| s | 2139 | 3.3% |
| Other values (48) | 23197 |
None
| Value | Count | Frequency (%) |
| é | 43 | |
| í | 14 | 11.6% |
| á | 10 | 8.3% |
| ë | 8 | 6.6% |
| ø | 6 | 5.0% |
| ó | 6 | 5.0% |
| ü | 4 | 3.3% |
| å | 4 | 3.3% |
| û | 3 | 2.5% |
| ï | 3 | 2.5% |
| Other values (12) | 20 |
actor_1_facebook_likes
Real number (ℝ)
| Distinct | 878 |
|---|---|
| Distinct (%) | 17.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6549.1347 |
| Minimum | 0 |
|---|---|
| Maximum | 640000 |
| Zeros | 26 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 94 |
| Q1 | 613 |
| median | 984 |
| Q3 | 11000 |
| 95-th percentile | 24000 |
| Maximum | 640000 |
| Range | 640000 |
| Interquartile range (IQR) | 10387 |
Descriptive statistics
| Standard deviation | 15052.477 |
|---|---|
| Coefficient of variation (CV) | 2.2983917 |
| Kurtosis | 683.01081 |
| Mean | 6549.1347 |
| Median Absolute Deviation (MAD) | 744.5 |
| Skewness | 19.144076 |
| Sum | 32732575 |
| Variance | 2.2657706 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 443 | 8.9% |
| 11000 | 209 | 4.2% |
| 2000 | 193 | 3.9% |
| 3000 | 152 | 3.0% |
| 12000 | 133 | 2.7% |
| 13000 | 127 | 2.5% |
| 14000 | 122 | 2.4% |
| 10000 | 112 | 2.2% |
| 18000 | 108 | 2.2% |
| 22000 | 81 | 1.6% |
| Other values (868) | 3318 |
| Value | Count | Frequency (%) |
| 0 | 26 | |
| 2 | 8 | 0.2% |
| 3 | 4 | 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 7 | 0.1% |
| 6 | 3 | 0.1% |
| 7 | 3 | 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 640000 | 1 | < 0.1% |
| 260000 | 3 | 0.1% |
| 164000 | 2 | < 0.1% |
| 137000 | 2 | < 0.1% |
| 87000 | 8 | 0.2% |
| 77000 | 1 | < 0.1% |
| 49000 | 27 | |
| 46000 | 1 | < 0.1% |
| 45000 | 5 | 0.1% |
| 44000 | 2 | < 0.1% |
gross
Real number (ℝ)
| Distinct | 4036 |
|---|---|
| Distinct (%) | 80.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44324642 |
| Minimum | 162 |
|---|---|
| Maximum | 7.6050585 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 162 |
|---|---|
| 5-th percentile | 126085.3 |
| Q1 | 8382841.2 |
| median | 25445749 |
| Q3 | 51376923 |
| 95-th percentile | 1.6611752 × 108 |
| Maximum | 7.6050585 × 108 |
| Range | 7.6050568 × 108 |
| Interquartile range (IQR) | 42994082 |
Descriptive statistics
| Standard deviation | 62344554 |
|---|---|
| Coefficient of variation (CV) | 1.4065439 |
| Kurtosis | 18.006554 |
| Mean | 44324642 |
| Median Absolute Deviation (MAD) | 19334093 |
| Skewness | 3.4652646 |
| Sum | 2.2153456 × 1011 |
| Variance | 3.8868434 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25445749 | 874 | 17.5% |
| 177343675 | 3 | 0.1% |
| 218051260 | 3 | 0.1% |
| 8000000 | 3 | 0.1% |
| 3000000 | 3 | 0.1% |
| 25000000 | 2 | < 0.1% |
| 10654581 | 2 | < 0.1% |
| 21378000 | 2 | < 0.1% |
| 22494487 | 2 | < 0.1% |
| 26505000 | 2 | < 0.1% |
| Other values (4026) | 4102 |
| Value | Count | Frequency (%) |
| 162 | 1 | |
| 703 | 1 | |
| 721 | 1 | |
| 728 | 1 | |
| 828 | 1 | |
| 1111 | 1 | |
| 1332 | 1 | |
| 1521 | 1 | |
| 1711 | 1 | |
| 2245 | 1 |
| Value | Count | Frequency (%) |
| 760505847 | 1 | |
| 658672302 | 1 | |
| 652177271 | 1 | |
| 623279547 | 1 | |
| 533316061 | 1 | |
| 474544677 | 1 | |
| 460935665 | 1 | |
| 458991599 | 1 | |
| 448130642 | 1 | |
| 436471036 | 1 |
genres
Text
| Distinct | 914 |
|---|---|
| Distinct (%) | 18.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 64 |
|---|---|
| Median length | 53 |
| Mean length | 20.32433 |
| Min length | 5 |
Characters and Unicode
| Total characters | 101581 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 499 ? |
|---|---|
| Unique (%) | 10.0% |
Sample
| 1st row | Action|Adventure|Fantasy|Sci-Fi |
|---|---|
| 2nd row | Action|Adventure|Fantasy |
| 3rd row | Action|Adventure|Thriller |
| 4th row | Action|Thriller |
| 5th row | Documentary |
| Value | Count | Frequency (%) |
| drama | 235 | 4.7% |
| comedy | 205 | 4.1% |
| comedy|drama | 189 | 3.8% |
| comedy|drama|romance | 187 | 3.7% |
| comedy|romance | 158 | 3.2% |
| drama|romance | 151 | 3.0% |
| crime|drama|thriller | 100 | 2.0% |
| horror | 70 | 1.4% |
| action|crime|drama|thriller | 67 | 1.3% |
| drama|thriller | 64 | 1.3% |
| Other values (904) | 3572 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 10436 | 10.3% |
| | | 9384 | 9.2% |
| a | 8993 | 8.9% |
| e | 7875 | 7.8% |
| m | 7328 | 7.2% |
| i | 6527 | 6.4% |
| o | 6268 | 6.2% |
| y | 4616 | 4.5% |
| n | 4458 | 4.4% |
| t | 4004 | 3.9% |
| Other values (25) | 31692 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 76573 | |
| Uppercase Letter | 15004 | 14.8% |
| Math Symbol | 9384 | 9.2% |
| Dash Punctuation | 620 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 10436 | |
| a | 8993 | |
| e | 7875 | |
| m | 7328 | |
| i | 6527 | |
| o | 6268 | |
| y | 4616 | 6.0% |
| n | 4458 | 5.8% |
| t | 4004 | 5.2% |
| l | 3476 | 4.5% |
| Other values (9) | 12592 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2745 | |
| D | 2692 | |
| A | 2299 | |
| F | 1765 | |
| T | 1398 | |
| R | 1100 | |
| M | 837 | 5.6% |
| S | 798 | 5.3% |
| H | 761 | 5.1% |
| W | 305 | 2.0% |
| Other values (4) | 304 | 2.0% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 9384 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 620 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 91577 | |
| Common | 10004 | 9.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 10436 | 11.4% |
| a | 8993 | 9.8% |
| e | 7875 | 8.6% |
| m | 7328 | 8.0% |
| i | 6527 | 7.1% |
| o | 6268 | 6.8% |
| y | 4616 | 5.0% |
| n | 4458 | 4.9% |
| t | 4004 | 4.4% |
| l | 3476 | 3.8% |
| Other values (23) | 27596 |
Common
| Value | Count | Frequency (%) |
| | | 9384 | |
| - | 620 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 101581 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 10436 | 10.3% |
| | | 9384 | 9.2% |
| a | 8993 | 8.9% |
| e | 7875 | 7.8% |
| m | 7328 | 7.2% |
| i | 6527 | 6.4% |
| o | 6268 | 6.2% |
| y | 4616 | 4.5% |
| n | 4458 | 4.4% |
| t | 4004 | 3.9% |
| Other values (25) | 31692 |
actor_1_name
Text
| Distinct | 2098 |
|---|---|
| Distinct (%) | 42.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 27 |
|---|---|
| Median length | 24 |
| Mean length | 13.185274 |
| Min length | 4 |
Characters and Unicode
| Total characters | 65900 |
|---|---|
| Distinct characters | 76 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1369 ? |
|---|---|
| Unique (%) | 27.4% |
Sample
| 1st row | CCH Pounder |
|---|---|
| 2nd row | Johnny Depp |
| 3rd row | Christoph Waltz |
| 4th row | Tom Hardy |
| 5th row | Doug Walker |
| Value | Count | Frequency (%) |
| robert | 108 | 1.0% |
| tom | 92 | 0.9% |
| michael | 89 | 0.9% |
| de | 57 | 0.6% |
| jason | 57 | 0.6% |
| james | 54 | 0.5% |
| bruce | 51 | 0.5% |
| steve | 50 | 0.5% |
| niro | 49 | 0.5% |
| jr | 48 | 0.5% |
| Other values (2889) | 9702 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6157 | 9.3% |
| a | 5692 | 8.6% |
| 5359 | 8.1% | |
| n | 4797 | 7.3% |
| r | 4278 | 6.5% |
| i | 4204 | 6.4% |
| o | 3891 | 5.9% |
| l | 3287 | 5.0% |
| t | 2546 | 3.9% |
| s | 2326 | 3.5% |
| Other values (66) | 23363 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 49627 | |
| Uppercase Letter | 10615 | 16.1% |
| Space Separator | 5359 | 8.1% |
| Other Punctuation | 225 | 0.3% |
| Dash Punctuation | 72 | 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6157 | |
| a | 5692 | |
| n | 4797 | |
| r | 4278 | |
| i | 4204 | 8.5% |
| o | 3891 | 7.8% |
| l | 3287 | 6.6% |
| t | 2546 | 5.1% |
| s | 2326 | 4.7% |
| h | 1770 | 3.6% |
| Other values (32) | 10679 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 940 | 8.9% |
| M | 904 | 8.5% |
| S | 845 | 8.0% |
| C | 808 | 7.6% |
| B | 738 | 7.0% |
| D | 718 | 6.8% |
| R | 628 | 5.9% |
| H | 519 | 4.9% |
| A | 499 | 4.7% |
| L | 487 | 4.6% |
| Other values (18) | 3529 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 178 | |
| ' | 47 | 20.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 | |
| 0 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 5359 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 72 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 60242 | |
| Common | 5658 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6157 | 10.2% |
| a | 5692 | 9.4% |
| n | 4797 | 8.0% |
| r | 4278 | 7.1% |
| i | 4204 | 7.0% |
| o | 3891 | 6.5% |
| l | 3287 | 5.5% |
| t | 2546 | 4.2% |
| s | 2326 | 3.9% |
| h | 1770 | 2.9% |
| Other values (60) | 21294 |
Common
| Value | Count | Frequency (%) |
| 5359 | ||
| . | 178 | 3.1% |
| - | 72 | 1.3% |
| ' | 47 | 0.8% |
| 5 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 65821 | |
| None | 79 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 6157 | 9.4% |
| a | 5692 | 8.6% |
| 5359 | 8.1% | |
| n | 4797 | 7.3% |
| r | 4278 | 6.5% |
| i | 4204 | 6.4% |
| o | 3891 | 5.9% |
| l | 3287 | 5.0% |
| t | 2546 | 3.9% |
| s | 2326 | 3.5% |
| Other values (48) | 23284 |
None
| Value | Count | Frequency (%) |
| é | 19 | |
| ë | 15 | |
| á | 7 | 8.9% |
| í | 6 | 7.6% |
| å | 5 | 6.3% |
| ç | 5 | 6.3% |
| ø | 4 | 5.1% |
| Ó | 3 | 3.8% |
| ü | 2 | 2.5% |
| Á | 2 | 2.5% |
| Other values (8) | 11 |
movie_title
Text
| Distinct | 4916 |
|---|---|
| Distinct (%) | 98.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 86 |
|---|---|
| Median length | 58 |
| Mean length | 15.309324 |
| Min length | 1 |
Characters and Unicode
| Total characters | 76516 |
|---|---|
| Distinct characters | 96 |
| Distinct categories | 13 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 4837 ? |
|---|---|
| Unique (%) | 96.8% |
Sample
| 1st row | Avatar |
|---|---|
| 2nd row | Pirates of the Caribbean: At World's End |
| 3rd row | Spectre |
| 4th row | The Dark Knight Rises |
| 5th row | Star Wars: Episode VII - The Force Awakens |
| Value | Count | Frequency (%) |
| the | 1591 | 11.5% |
| of | 480 | 3.5% |
| a | 188 | 1.4% |
| and | 148 | 1.1% |
| in | 123 | 0.9% |
| to | 106 | 0.8% |
| 2 | 103 | 0.7% |
| 80 | 0.6% | |
| man | 66 | 0.5% |
| love | 55 | 0.4% |
| Other values (4905) | 10906 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8848 | 11.6% | |
| e | 7839 | 10.2% |
| a | 4807 | 6.3% |
| o | 4628 | 6.0% |
| n | 4104 | 5.4% |
| r | 4103 | 5.4% |
| i | 3907 | 5.1% |
| t | 3788 | 5.0% |
| s | 2981 | 3.9% |
| h | 2952 | 3.9% |
| Other values (86) | 28559 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 53945 | |
| Uppercase Letter | 12136 | 15.9% |
| Space Separator | 8848 | 11.6% |
| Other Punctuation | 948 | 1.2% |
| Decimal Number | 525 | 0.7% |
| Dash Punctuation | 94 | 0.1% |
| Open Punctuation | 5 | < 0.1% |
| Close Punctuation | 5 | < 0.1% |
| Currency Symbol | 4 | < 0.1% |
| Other Number | 2 | < 0.1% |
| Other values (3) | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 7839 | |
| a | 4807 | 8.9% |
| o | 4628 | 8.6% |
| n | 4104 | 7.6% |
| r | 4103 | 7.6% |
| i | 3907 | 7.2% |
| t | 3788 | 7.0% |
| s | 2981 | 5.5% |
| h | 2952 | 5.5% |
| l | 2509 | 4.7% |
| Other values (25) | 12327 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1707 | |
| S | 1048 | 8.6% |
| M | 818 | 6.7% |
| B | 773 | 6.4% |
| D | 722 | 5.9% |
| C | 681 | 5.6% |
| A | 660 | 5.4% |
| L | 573 | 4.7% |
| H | 561 | 4.6% |
| W | 502 | 4.1% |
| Other values (17) | 4091 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 369 | |
| ' | 230 | |
| . | 145 | 15.3% |
| , | 78 | 8.2% |
| & | 61 | 6.4% |
| ! | 32 | 3.4% |
| ? | 16 | 1.7% |
| / | 8 | 0.8% |
| * | 5 | 0.5% |
| # | 2 | 0.2% |
| Other values (2) | 2 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 146 | |
| 3 | 87 | |
| 0 | 86 | |
| 1 | 82 | |
| 4 | 35 | 6.7% |
| 8 | 22 | 4.2% |
| 5 | 21 | 4.0% |
| 9 | 17 | 3.2% |
| 7 | 15 | 2.9% |
| 6 | 14 | 2.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 | |
| [ | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 | |
| ] | 2 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 2 | |
| $ | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 8848 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 94 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 66081 | |
| Common | 10435 | 13.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 7839 | 11.9% |
| a | 4807 | 7.3% |
| o | 4628 | 7.0% |
| n | 4104 | 6.2% |
| r | 4103 | 6.2% |
| i | 3907 | 5.9% |
| t | 3788 | 5.7% |
| s | 2981 | 4.5% |
| h | 2952 | 4.5% |
| l | 2509 | 3.8% |
| Other values (52) | 24463 |
Common
| Value | Count | Frequency (%) |
| 8848 | ||
| : | 369 | 3.5% |
| ' | 230 | 2.2% |
| 2 | 146 | 1.4% |
| . | 145 | 1.4% |
| - | 94 | 0.9% |
| 3 | 87 | 0.8% |
| 0 | 86 | 0.8% |
| 1 | 82 | 0.8% |
| , | 78 | 0.7% |
| Other values (24) | 270 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 76493 | |
| None | 23 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8848 | 11.6% | |
| e | 7839 | 10.2% |
| a | 4807 | 6.3% |
| o | 4628 | 6.1% |
| n | 4104 | 5.4% |
| r | 4103 | 5.4% |
| i | 3907 | 5.1% |
| t | 3788 | 5.0% |
| s | 2981 | 3.9% |
| h | 2952 | 3.9% |
| Other values (72) | 28536 |
None
| Value | Count | Frequency (%) |
| é | 8 | |
| ½ | 2 | 8.7% |
| ¢ | 2 | 8.7% |
| ü | 1 | 4.3% |
| è | 1 | 4.3% |
| · | 1 | 4.3% |
| à | 1 | 4.3% |
| Æ | 1 | 4.3% |
| ä | 1 | 4.3% |
| á | 1 | 4.3% |
| Other values (4) | 4 |
num_voted_users
Real number (ℝ)
| Distinct | 4826 |
|---|---|
| Distinct (%) | 96.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 83470.199 |
| Minimum | 5 |
|---|---|
| Maximum | 1689764 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 511.55 |
| Q1 | 8560 |
| median | 34260.5 |
| Q3 | 96120.75 |
| 95-th percentile | 332096.65 |
| Maximum | 1689764 |
| Range | 1689759 |
| Interquartile range (IQR) | 87560.75 |
Descriptive statistics
| Standard deviation | 138086.56 |
|---|---|
| Coefficient of variation (CV) | 1.6543217 |
| Kurtosis | 24.611289 |
| Mean | 83470.199 |
| Median Absolute Deviation (MAD) | 30735 |
| Skewness | 4.0349324 |
| Sum | 4.1718405 × 108 |
| Variance | 1.9067898 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 57 | 5 | 0.1% |
| 6 | 4 | 0.1% |
| 2541 | 3 | 0.1% |
| 38 | 3 | 0.1% |
| 162 | 3 | 0.1% |
| 62 | 3 | 0.1% |
| 3665 | 3 | 0.1% |
| 8 | 3 | 0.1% |
| 53 | 3 | 0.1% |
| 3119 | 3 | 0.1% |
| Other values (4816) | 4965 |
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 6 | 4 | |
| 7 | 2 | |
| 8 | 3 | |
| 10 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 15 | 2 | |
| 16 | 1 | < 0.1% |
| 18 | 2 | |
| 19 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1689764 | 1 | |
| 1676169 | 1 | |
| 1468200 | 1 | |
| 1347461 | 1 | |
| 1324680 | 1 | |
| 1251222 | 1 | |
| 1238746 | 1 | |
| 1217752 | 1 | |
| 1215718 | 1 | |
| 1155770 | 1 |
cast_total_facebook_likes
Real number (ℝ)
| Distinct | 3978 |
|---|---|
| Distinct (%) | 79.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9676.9412 |
| Minimum | 0 |
|---|---|
| Maximum | 656730 |
| Zeros | 33 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 175.85 |
| Q1 | 1405.5 |
| median | 3085.5 |
| Q3 | 13740.5 |
| 95-th percentile | 36892.75 |
| Maximum | 656730 |
| Range | 656730 |
| Interquartile range (IQR) | 12335 |
Descriptive statistics
| Standard deviation | 18165.405 |
|---|---|
| Coefficient of variation (CV) | 1.8771846 |
| Kurtosis | 364.38585 |
| Mean | 9676.9412 |
| Median Absolute Deviation (MAD) | 2302.5 |
| Skewness | 12.923944 |
| Sum | 48365352 |
| Variance | 3.2998192 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 33 | 0.7% |
| 5 | 7 | 0.1% |
| 2 | 6 | 0.1% |
| 2020 | 6 | 0.1% |
| 29 | 5 | 0.1% |
| 1044 | 5 | 0.1% |
| 673 | 5 | 0.1% |
| 1233 | 4 | 0.1% |
| 2251 | 4 | 0.1% |
| 2990 | 4 | 0.1% |
| Other values (3968) | 4919 |
| Value | Count | Frequency (%) |
| 0 | 33 | |
| 2 | 6 | 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 7 | 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 656730 | 1 | |
| 303717 | 1 | |
| 283939 | 1 | |
| 263584 | 1 | |
| 261818 | 1 | |
| 170118 | 1 | |
| 140268 | 1 | |
| 137712 | 1 | |
| 120797 | 1 | |
| 108016 | 1 |
actor_3_name
Text
| Distinct | 3522 |
|---|---|
| Distinct (%) | 70.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 29 |
|---|---|
| Median length | 25 |
| Mean length | 13.052621 |
| Min length | 3 |
Characters and Unicode
| Total characters | 65237 |
|---|---|
| Distinct characters | 81 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2669 ? |
|---|---|
| Unique (%) | 53.4% |
Sample
| 1st row | Wes Studi |
|---|---|
| 2nd row | Jack Davenport |
| 3rd row | Stephanie Sigman |
| 4th row | Joseph Gordon-Levitt |
| 5th row | unknown |
| Value | Count | Frequency (%) |
| michael | 85 | 0.8% |
| john | 78 | 0.8% |
| david | 70 | 0.7% |
| james | 69 | 0.7% |
| robert | 46 | 0.4% |
| tom | 43 | 0.4% |
| kevin | 41 | 0.4% |
| paul | 41 | 0.4% |
| peter | 38 | 0.4% |
| steve | 36 | 0.3% |
| Other values (4308) | 9775 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6135 | 9.4% |
| a | 5948 | 9.1% |
| 5324 | 8.2% | |
| n | 4619 | 7.1% |
| r | 4144 | 6.4% |
| i | 3939 | 6.0% |
| o | 3565 | 5.5% |
| l | 3475 | 5.3% |
| t | 2336 | 3.6% |
| s | 2312 | 3.5% |
| Other values (71) | 23440 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 49008 | |
| Uppercase Letter | 10593 | 16.2% |
| Space Separator | 5324 | 8.2% |
| Other Punctuation | 231 | 0.4% |
| Dash Punctuation | 79 | 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6135 | |
| a | 5948 | |
| n | 4619 | |
| r | 4144 | 8.5% |
| i | 3939 | 8.0% |
| o | 3565 | 7.3% |
| l | 3475 | 7.1% |
| t | 2336 | 4.8% |
| s | 2312 | 4.7% |
| h | 1838 | 3.8% |
| Other values (34) | 10697 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 974 | 9.2% |
| S | 822 | 7.8% |
| J | 822 | 7.8% |
| B | 799 | 7.5% |
| C | 785 | 7.4% |
| D | 648 | 6.1% |
| R | 612 | 5.8% |
| A | 584 | 5.5% |
| L | 530 | 5.0% |
| K | 461 | 4.4% |
| Other values (21) | 3556 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 168 | |
| ' | 63 | 27.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 | |
| 0 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 5324 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 79 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59601 | |
| Common | 5636 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6135 | 10.3% |
| a | 5948 | 10.0% |
| n | 4619 | 7.7% |
| r | 4144 | 7.0% |
| i | 3939 | 6.6% |
| o | 3565 | 6.0% |
| l | 3475 | 5.8% |
| t | 2336 | 3.9% |
| s | 2312 | 3.9% |
| h | 1838 | 3.1% |
| Other values (65) | 21290 |
Common
| Value | Count | Frequency (%) |
| 5324 | ||
| . | 168 | 3.0% |
| - | 79 | 1.4% |
| ' | 63 | 1.1% |
| 5 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 65103 | |
| None | 134 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 6135 | 9.4% |
| a | 5948 | 9.1% |
| 5324 | 8.2% | |
| n | 4619 | 7.1% |
| r | 4144 | 6.4% |
| i | 3939 | 6.1% |
| o | 3565 | 5.5% |
| l | 3475 | 5.3% |
| t | 2336 | 3.6% |
| s | 2312 | 3.6% |
| Other values (48) | 23306 |
None
| Value | Count | Frequency (%) |
| é | 49 | |
| í | 14 | 10.4% |
| á | 13 | 9.7% |
| ó | 9 | 6.7% |
| ü | 7 | 5.2% |
| ë | 7 | 5.2% |
| à | 5 | 3.7% |
| è | 4 | 3.0% |
| ç | 3 | 2.2% |
| ô | 3 | 2.2% |
| Other values (13) | 20 |
plot_keywords
Text
| Distinct | 4761 |
|---|---|
| Distinct (%) | 95.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 149 |
|---|---|
| Median length | 102 |
| Mean length | 50.960984 |
| Min length | 2 |
Characters and Unicode
| Total characters | 254703 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4679 ? |
|---|---|
| Unique (%) | 93.6% |
Sample
| 1st row | avatar|future|marine|native|paraplegic |
|---|---|
| 2nd row | goddess|marriage ceremony|marriage proposal|pirate|singapore |
| 3rd row | bomb|espionage|sequel|spy|terrorist |
| 4th row | deception|imprisonment|lawlessness|police officer|terrorist plot |
| 5th row | none |
| Value | Count | Frequency (%) |
| in | 331 | 1.8% |
| of | 219 | 1.2% |
| on | 209 | 1.2% |
| the | 189 | 1.0% |
| a | 183 | 1.0% |
| to | 176 | 1.0% |
| none | 152 | 0.8% |
| york | 122 | 0.7% |
| based | 106 | 0.6% |
| female | 104 | 0.6% |
| Other values (11487) | 16224 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 24738 | 9.7% |
| a | 19403 | 7.6% |
| | | 19035 | 7.5% |
| i | 18589 | 7.3% |
| r | 17953 | 7.0% |
| t | 16049 | 6.3% |
| n | 15836 | 6.2% |
| o | 15492 | 6.1% |
| s | 13176 | 5.2% |
| 13017 | 5.1% | |
| Other values (32) | 81415 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 221304 | |
| Math Symbol | 19035 | 7.5% |
| Space Separator | 13017 | 5.1% |
| Decimal Number | 1127 | 0.4% |
| Other Punctuation | 218 | 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 24738 | |
| a | 19403 | 8.8% |
| i | 18589 | 8.4% |
| r | 17953 | 8.1% |
| t | 16049 | 7.3% |
| n | 15836 | 7.2% |
| o | 15492 | 7.0% |
| s | 13176 | 6.0% |
| l | 11079 | 5.0% |
| c | 9377 | 4.2% |
| Other values (16) | 59612 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 283 | |
| 0 | 269 | |
| 9 | 221 | |
| 2 | 81 | 7.2% |
| 8 | 65 | 5.8% |
| 7 | 49 | 4.3% |
| 5 | 47 | 4.2% |
| 3 | 44 | 3.9% |
| 6 | 38 | 3.4% |
| 4 | 30 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 130 | |
| ' | 88 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 19035 |
Space Separator
| Value | Count | Frequency (%) |
| 13017 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 221304 | |
| Common | 33399 | 13.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 24738 | |
| a | 19403 | 8.8% |
| i | 18589 | 8.4% |
| r | 17953 | 8.1% |
| t | 16049 | 7.3% |
| n | 15836 | 7.2% |
| o | 15492 | 7.0% |
| s | 13176 | 6.0% |
| l | 11079 | 5.0% |
| c | 9377 | 4.2% |
| Other values (16) | 59612 |
Common
| Value | Count | Frequency (%) |
| | | 19035 | |
| 13017 | ||
| 1 | 283 | 0.8% |
| 0 | 269 | 0.8% |
| 9 | 221 | 0.7% |
| . | 130 | 0.4% |
| ' | 88 | 0.3% |
| 2 | 81 | 0.2% |
| 8 | 65 | 0.2% |
| 7 | 49 | 0.1% |
| Other values (6) | 161 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 254703 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 24738 | 9.7% |
| a | 19403 | 7.6% |
| | | 19035 | 7.5% |
| i | 18589 | 7.3% |
| r | 17953 | 7.0% |
| t | 16049 | 6.3% |
| n | 15836 | 6.2% |
| o | 15492 | 6.1% |
| s | 13176 | 5.2% |
| 13017 | 5.1% | |
| Other values (32) | 81415 |
movie_imdb_link
Text
| Distinct | 4919 |
|---|---|
| Distinct (%) | 98.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 52 |
|---|---|
| Median length | 52 |
| Mean length | 52 |
| Min length | 52 |
Characters and Unicode
| Total characters | 259896 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4843 ? |
|---|---|
| Unique (%) | 96.9% |
Sample
| 1st row | http://www.imdb.com/title/tt0499549/?ref_=fn_tt_tt_1 |
|---|---|
| 2nd row | http://www.imdb.com/title/tt0449088/?ref_=fn_tt_tt_1 |
| 3rd row | http://www.imdb.com/title/tt2379713/?ref_=fn_tt_tt_1 |
| 4th row | http://www.imdb.com/title/tt1345836/?ref_=fn_tt_tt_1 |
| 5th row | http://www.imdb.com/title/tt5289954/?ref_=fn_tt_tt_1 |
| Value | Count | Frequency (%) |
| http://www.imdb.com/title/tt0360717/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt2638144/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt2224026/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt0075005/?ref_=fn_tt_tt_1 | 2 | < 0.1% |
| http://www.imdb.com/title/tt0413300/?ref_=fn_tt_tt_1 | 2 | < 0.1% |
| http://www.imdb.com/title/tt0364725/?ref_=fn_tt_tt_1 | 2 | < 0.1% |
| http://www.imdb.com/title/tt0929632/?ref_=fn_tt_tt_1 | 2 | < 0.1% |
| http://www.imdb.com/title/tt4178092/?ref_=fn_tt_tt_1 | 2 | < 0.1% |
| http://www.imdb.com/title/tt0397065/?ref_=fn_tt_tt_1 | 2 | < 0.1% |
| http://www.imdb.com/title/tt0072271/?ref_=fn_tt_tt_1 | 2 | < 0.1% |
| Other values (4909) | 4975 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 49980 | |
| / | 24990 | 9.6% |
| _ | 19992 | 7.7% |
| w | 14994 | 5.8% |
| . | 9996 | 3.8% |
| m | 9996 | 3.8% |
| e | 9996 | 3.8% |
| f | 9996 | 3.8% |
| i | 9996 | 3.8% |
| 1 | 9814 | 3.8% |
| Other values (21) | 90146 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 149940 | |
| Other Punctuation | 44982 | 17.3% |
| Decimal Number | 39984 | 15.4% |
| Connector Punctuation | 19992 | 7.7% |
| Math Symbol | 4998 | 1.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 49980 | |
| w | 14994 | 10.0% |
| m | 9996 | 6.7% |
| e | 9996 | 6.7% |
| f | 9996 | 6.7% |
| i | 9996 | 6.7% |
| p | 4998 | 3.3% |
| h | 4998 | 3.3% |
| d | 4998 | 3.3% |
| l | 4998 | 3.3% |
| Other values (5) | 24990 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9814 | |
| 0 | 6755 | |
| 2 | 3630 | 9.1% |
| 3 | 3220 | 8.1% |
| 4 | 3150 | 7.9% |
| 8 | 2887 | 7.2% |
| 9 | 2702 | 6.8% |
| 6 | 2696 | 6.7% |
| 7 | 2672 | 6.7% |
| 5 | 2458 | 6.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 24990 | |
| . | 9996 | 22.2% |
| : | 4998 | 11.1% |
| ? | 4998 | 11.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 19992 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 4998 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 149940 | |
| Common | 109956 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 24990 | |
| _ | 19992 | |
| . | 9996 | 9.1% |
| 1 | 9814 | 8.9% |
| 0 | 6755 | 6.1% |
| : | 4998 | 4.5% |
| ? | 4998 | 4.5% |
| = | 4998 | 4.5% |
| 2 | 3630 | 3.3% |
| 3 | 3220 | 2.9% |
| Other values (6) | 16565 |
Latin
| Value | Count | Frequency (%) |
| t | 49980 | |
| w | 14994 | 10.0% |
| m | 9996 | 6.7% |
| e | 9996 | 6.7% |
| f | 9996 | 6.7% |
| i | 9996 | 6.7% |
| p | 4998 | 3.3% |
| h | 4998 | 3.3% |
| d | 4998 | 3.3% |
| l | 4998 | 3.3% |
| Other values (5) | 24990 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 259896 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 49980 | |
| / | 24990 | 9.6% |
| _ | 19992 | 7.7% |
| w | 14994 | 5.8% |
| . | 9996 | 3.8% |
| m | 9996 | 3.8% |
| e | 9996 | 3.8% |
| f | 9996 | 3.8% |
| i | 9996 | 3.8% |
| 1 | 9814 | 3.8% |
| Other values (21) | 90146 |
num_user_for_reviews
Real number (ℝ)
| Distinct | 954 |
|---|---|
| Distinct (%) | 19.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 271.52721 |
| Minimum | 1 |
|---|---|
| Maximum | 5060 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 65 |
| median | 156 |
| Q3 | 323 |
| 95-th percentile | 902.15 |
| Maximum | 5060 |
| Range | 5059 |
| Interquartile range (IQR) | 258 |
Descriptive statistics
| Standard deviation | 377.05627 |
|---|---|
| Coefficient of variation (CV) | 1.38865 |
| Kurtosis | 26.837621 |
| Mean | 271.52721 |
| Median Absolute Deviation (MAD) | 112 |
| Skewness | 4.1555332 |
| Sum | 1357093 |
| Variance | 142171.43 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 51 | 1.0% |
| 156 | 36 | 0.7% |
| 3 | 33 | 0.7% |
| 26 | 32 | 0.6% |
| 2 | 32 | 0.6% |
| 10 | 29 | 0.6% |
| 6 | 28 | 0.6% |
| 50 | 26 | 0.5% |
| 8 | 25 | 0.5% |
| 32 | 25 | 0.5% |
| Other values (944) | 4681 |
| Value | Count | Frequency (%) |
| 1 | 51 | |
| 2 | 32 | |
| 3 | 33 | |
| 4 | 23 | |
| 5 | 19 | 0.4% |
| 6 | 28 | |
| 7 | 17 | 0.3% |
| 8 | 25 | |
| 9 | 23 | |
| 10 | 29 |
| Value | Count | Frequency (%) |
| 5060 | 1 | |
| 4667 | 1 | |
| 4144 | 1 | |
| 3646 | 1 | |
| 3597 | 1 | |
| 3516 | 1 | |
| 3400 | 1 | |
| 3286 | 1 | |
| 3189 | 1 | |
| 3054 | 1 |
language
Categorical
IMBALANCE 
| Distinct | 46 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| English | |
|---|---|
| French | 73 |
| Spanish | 40 |
| Hindi | 28 |
| Mandarin | 24 |
| Other values (41) | 157 |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.9811925 |
| Min length | 4 |
Characters and Unicode
| Total characters | 34892 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | English |
|---|---|
| 2nd row | English |
| 3rd row | English |
| 4th row | English |
| 5th row | English |
Common Values
| Value | Count | Frequency (%) |
| English | 4676 | |
| French | 73 | 1.5% |
| Spanish | 40 | 0.8% |
| Hindi | 28 | 0.6% |
| Mandarin | 24 | 0.5% |
| German | 19 | 0.4% |
| Japanese | 17 | 0.3% |
| Cantonese | 11 | 0.2% |
| Italian | 11 | 0.2% |
| Russian | 11 | 0.2% |
| Other values (36) | 88 | 1.8% |
Length
| Value | Count | Frequency (%) |
| english | 4676 | |
| french | 73 | 1.5% |
| spanish | 40 | 0.8% |
| hindi | 28 | 0.6% |
| mandarin | 24 | 0.5% |
| german | 19 | 0.4% |
| japanese | 17 | 0.3% |
| cantonese | 11 | 0.2% |
| italian | 11 | 0.2% |
| russian | 11 | 0.2% |
| Other values (36) | 88 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 4997 | |
| i | 4876 | |
| h | 4817 | |
| s | 4799 | |
| l | 4703 | |
| g | 4694 | |
| E | 4676 | |
| a | 246 | 0.7% |
| e | 213 | 0.6% |
| r | 158 | 0.5% |
| Other values (33) | 713 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29894 | |
| Uppercase Letter | 4998 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 4997 | |
| i | 4876 | |
| h | 4817 | |
| s | 4799 | |
| l | 4703 | |
| g | 4694 | |
| a | 246 | 0.8% |
| e | 213 | 0.7% |
| r | 158 | 0.5% |
| c | 88 | 0.3% |
| Other values (13) | 303 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 4676 | |
| F | 74 | 1.5% |
| S | 47 | 0.9% |
| H | 34 | 0.7% |
| M | 26 | 0.5% |
| G | 20 | 0.4% |
| J | 17 | 0.3% |
| P | 17 | 0.3% |
| C | 15 | 0.3% |
| I | 15 | 0.3% |
| Other values (10) | 57 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34892 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 4997 | |
| i | 4876 | |
| h | 4817 | |
| s | 4799 | |
| l | 4703 | |
| g | 4694 | |
| E | 4676 | |
| a | 246 | 0.7% |
| e | 213 | 0.6% |
| r | 158 | 0.5% |
| Other values (33) | 713 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34892 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 4997 | |
| i | 4876 | |
| h | 4817 | |
| s | 4799 | |
| l | 4703 | |
| g | 4694 | |
| E | 4676 | |
| a | 246 | 0.7% |
| e | 213 | 0.6% |
| r | 158 | 0.5% |
| Other values (33) | 713 | 2.0% |
country
Text
| Distinct | 65 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
Length
| Max length | 20 |
|---|---|
| Median length | 3 |
| Mean length | 3.4909964 |
| Min length | 2 |
Characters and Unicode
| Total characters | 17448 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 28 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | USA |
|---|---|
| 2nd row | USA |
| 3rd row | UK |
| 4th row | USA |
| 5th row | USA |
| Value | Count | Frequency (%) |
| usa | 3778 | |
| uk | 443 | 8.7% |
| france | 154 | 3.0% |
| canada | 124 | 2.4% |
| germany | 99 | 2.0% |
| australia | 55 | 1.1% |
| india | 34 | 0.7% |
| spain | 33 | 0.7% |
| china | 28 | 0.6% |
| italy | 23 | 0.5% |
| Other values (63) | 293 | 5.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 4223 | |
| A | 3848 | |
| S | 3845 | |
| a | 1082 | 6.2% |
| n | 632 | 3.6% |
| K | 476 | 2.7% |
| e | 409 | 2.3% |
| r | 403 | 2.3% |
| i | 247 | 1.4% |
| d | 216 | 1.2% |
| Other values (37) | 2067 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 13062 | |
| Lowercase Letter | 4320 | 24.8% |
| Space Separator | 66 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1082 | |
| n | 632 | |
| e | 409 | 9.5% |
| r | 403 | 9.3% |
| i | 247 | 5.7% |
| d | 216 | 5.0% |
| c | 193 | 4.5% |
| l | 154 | 3.6% |
| y | 138 | 3.2% |
| m | 125 | 2.9% |
| Other values (14) | 721 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4223 | |
| A | 3848 | |
| S | 3845 | |
| K | 476 | 3.6% |
| C | 159 | 1.2% |
| F | 155 | 1.2% |
| G | 102 | 0.8% |
| I | 81 | 0.6% |
| N | 30 | 0.2% |
| J | 22 | 0.2% |
| Other values (12) | 121 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 66 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17382 | |
| Common | 66 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 4223 | |
| A | 3848 | |
| S | 3845 | |
| a | 1082 | 6.2% |
| n | 632 | 3.6% |
| K | 476 | 2.7% |
| e | 409 | 2.4% |
| r | 403 | 2.3% |
| i | 247 | 1.4% |
| d | 216 | 1.2% |
| Other values (36) | 2001 |
Common
| Value | Count | Frequency (%) |
| 66 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 4223 | |
| A | 3848 | |
| S | 3845 | |
| a | 1082 | 6.2% |
| n | 632 | 3.6% |
| K | 476 | 2.7% |
| e | 409 | 2.3% |
| r | 403 | 2.3% |
| i | 247 | 1.4% |
| d | 216 | 1.2% |
| Other values (37) | 2067 |
content_rating
Categorical
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| R | |
|---|---|
| PG-13 | |
| PG | |
| Not Rated | |
| G | 112 |
| Other values (13) |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 3.1846739 |
| Min length | 1 |
Characters and Unicode
| Total characters | 15917 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PG-13 |
|---|---|
| 2nd row | PG-13 |
| 3rd row | PG-13 |
| 4th row | PG-13 |
| 5th row | Not Rated |
Common Values
| Value | Count | Frequency (%) |
| R | 2098 | |
| PG-13 | 1444 | |
| PG | 698 | 14.0% |
| Not Rated | 417 | 8.3% |
| G | 112 | 2.2% |
| Unrated | 60 | 1.2% |
| Approved | 55 | 1.1% |
| TV-14 | 30 | 0.6% |
| TV-MA | 19 | 0.4% |
| TV-PG | 13 | 0.3% |
| Other values (8) | 52 | 1.0% |
Length
| Value | Count | Frequency (%) |
| r | 2098 | |
| pg-13 | 1444 | |
| pg | 698 | 12.9% |
| not | 417 | 7.7% |
| rated | 417 | 7.7% |
| g | 112 | 2.1% |
| unrated | 60 | 1.1% |
| approved | 55 | 1.0% |
| tv-14 | 30 | 0.6% |
| tv-ma | 19 | 0.4% |
| Other values (9) | 65 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 2515 | |
| G | 2283 | |
| P | 2170 | |
| - | 1525 | |
| 1 | 1481 | |
| 3 | 1444 | |
| t | 894 | 5.6% |
| e | 541 | 3.4% |
| d | 541 | 3.4% |
| a | 486 | 3.1% |
| Other values (18) | 2037 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7720 | |
| Lowercase Letter | 3292 | |
| Decimal Number | 2963 | 18.6% |
| Dash Punctuation | 1525 | 9.6% |
| Space Separator | 417 | 2.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 2515 | |
| G | 2283 | |
| P | 2170 | |
| N | 424 | 5.5% |
| A | 74 | 1.0% |
| T | 74 | 1.0% |
| V | 74 | 1.0% |
| U | 60 | 0.8% |
| M | 24 | 0.3% |
| X | 13 | 0.2% |
| Other values (2) | 9 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 894 | |
| e | 541 | |
| d | 541 | |
| a | 486 | |
| o | 472 | |
| r | 115 | 3.5% |
| p | 110 | 3.3% |
| n | 60 | 1.8% |
| v | 55 | 1.7% |
| s | 18 | 0.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1481 | |
| 3 | 1444 | |
| 4 | 30 | 1.0% |
| 7 | 8 | 0.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1525 |
Space Separator
| Value | Count | Frequency (%) |
| 417 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11012 | |
| Common | 4905 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 2515 | |
| G | 2283 | |
| P | 2170 | |
| t | 894 | 8.1% |
| e | 541 | 4.9% |
| d | 541 | 4.9% |
| a | 486 | 4.4% |
| o | 472 | 4.3% |
| N | 424 | 3.9% |
| r | 115 | 1.0% |
| Other values (12) | 571 | 5.2% |
Common
| Value | Count | Frequency (%) |
| - | 1525 | |
| 1 | 1481 | |
| 3 | 1444 | |
| 417 | 8.5% | |
| 4 | 30 | 0.6% |
| 7 | 8 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15917 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 2515 | |
| G | 2283 | |
| P | 2170 | |
| - | 1525 | |
| 1 | 1481 | |
| 3 | 1444 | |
| t | 894 | 5.6% |
| e | 541 | 3.4% |
| d | 541 | 3.4% |
| a | 486 | 3.1% |
| Other values (18) | 2037 |
budget
Real number (ℝ)
SKEWED 
| Distinct | 439 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37823658 |
| Minimum | 218 |
|---|---|
| Maximum | 1.22155 × 1010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 218 |
|---|---|
| 5-th percentile | 585500 |
| Q1 | 7000000 |
| median | 20000000 |
| Q3 | 40000000 |
| 95-th percentile | 1.2045 × 108 |
| Maximum | 1.22155 × 1010 |
| Range | 1.22155 × 1010 |
| Interquartile range (IQR) | 33000000 |
Descriptive statistics
| Standard deviation | 1.9671219 × 108 |
|---|---|
| Coefficient of variation (CV) | 5.2007712 |
| Kurtosis | 2991.8614 |
| Mean | 37823658 |
| Median Absolute Deviation (MAD) | 15000000 |
| Skewness | 50.469926 |
| Sum | 1.8904264 × 1011 |
| Variance | 3.8695686 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20000000 | 658 | 13.2% |
| 15000000 | 141 | 2.8% |
| 30000000 | 140 | 2.8% |
| 25000000 | 140 | 2.8% |
| 10000000 | 135 | 2.7% |
| 40000000 | 130 | 2.6% |
| 35000000 | 119 | 2.4% |
| 5000000 | 110 | 2.2% |
| 50000000 | 101 | 2.0% |
| 60000000 | 92 | 1.8% |
| Other values (429) | 3232 |
| Value | Count | Frequency (%) |
| 218 | 1 | < 0.1% |
| 1100 | 1 | < 0.1% |
| 1400 | 1 | < 0.1% |
| 3250 | 1 | < 0.1% |
| 4500 | 1 | < 0.1% |
| 7000 | 3 | |
| 9000 | 1 | < 0.1% |
| 10000 | 3 | |
| 13000 | 1 | < 0.1% |
| 14000 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.22155 × 1010 | 1 | |
| 4200000000 | 1 | |
| 2500000000 | 1 | |
| 2400000000 | 1 | |
| 2127519898 | 1 | |
| 1100000000 | 1 | |
| 1000000000 | 1 | |
| 700000000 | 2 | |
| 600000000 | 1 | |
| 553632000 | 1 |
title_year
Real number (ℝ)
| Distinct | 91 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2002.523 |
| Minimum | 1916 |
|---|---|
| Maximum | 2016 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 1916 |
|---|---|
| 5-th percentile | 1979 |
| Q1 | 1999 |
| median | 2005 |
| Q3 | 2011 |
| 95-th percentile | 2015 |
| Maximum | 2016 |
| Range | 100 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 12.346385 |
|---|---|
| Coefficient of variation (CV) | 0.006165415 |
| Kurtosis | 7.7507055 |
| Mean | 2002.523 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -2.3375399 |
| Sum | 10008610 |
| Variance | 152.43323 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2005 | 328 | 6.6% |
| 2009 | 258 | 5.2% |
| 2014 | 248 | 5.0% |
| 2006 | 238 | 4.8% |
| 2013 | 236 | 4.7% |
| 2010 | 229 | 4.6% |
| 2008 | 225 | 4.5% |
| 2011 | 225 | 4.5% |
| 2015 | 222 | 4.4% |
| 2012 | 218 | 4.4% |
| Other values (81) | 2571 |
| Value | Count | Frequency (%) |
| 1916 | 1 | |
| 1920 | 1 | |
| 1925 | 1 | |
| 1927 | 1 | |
| 1929 | 2 | |
| 1930 | 1 | |
| 1932 | 1 | |
| 1933 | 2 | |
| 1934 | 1 | |
| 1935 | 1 |
| Value | Count | Frequency (%) |
| 2016 | 103 | 2.1% |
| 2015 | 222 | |
| 2014 | 248 | |
| 2013 | 236 | |
| 2012 | 218 | |
| 2011 | 225 | |
| 2010 | 229 | |
| 2009 | 258 | |
| 2008 | 225 | |
| 2007 | 202 |
actor_2_facebook_likes
Real number (ℝ)
ZEROS 
| Distinct | 917 |
|---|---|
| Distinct (%) | 18.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1640.2729 |
| Minimum | 0 |
|---|---|
| Maximum | 137000 |
| Zeros | 55 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 25.85 |
| Q1 | 281 |
| median | 595 |
| Q3 | 912.75 |
| 95-th percentile | 11000 |
| Maximum | 137000 |
| Range | 137000 |
| Interquartile range (IQR) | 631.75 |
Descriptive statistics
| Standard deviation | 4026.0325 |
|---|---|
| Coefficient of variation (CV) | 2.4544894 |
| Kurtosis | 262.6445 |
| Mean | 1640.2729 |
| Median Absolute Deviation (MAD) | 317 |
| Skewness | 10.024356 |
| Sum | 8198084 |
| Variance | 16208938 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 301 | 6.0% |
| 11000 | 110 | 2.2% |
| 2000 | 100 | 2.0% |
| 3000 | 75 | 1.5% |
| 0 | 55 | 1.1% |
| 10000 | 46 | 0.9% |
| 14000 | 40 | 0.8% |
| 13000 | 40 | 0.8% |
| 826 | 37 | 0.7% |
| 4000 | 34 | 0.7% |
| Other values (907) | 4160 |
| Value | Count | Frequency (%) |
| 0 | 55 | |
| 2 | 14 | 0.3% |
| 3 | 14 | 0.3% |
| 4 | 11 | 0.2% |
| 5 | 10 | 0.2% |
| 6 | 7 | 0.1% |
| 7 | 4 | 0.1% |
| 8 | 9 | 0.2% |
| 9 | 13 | 0.3% |
| 10 | 9 | 0.2% |
| Value | Count | Frequency (%) |
| 137000 | 1 | < 0.1% |
| 29000 | 1 | < 0.1% |
| 27000 | 2 | < 0.1% |
| 25000 | 3 | 0.1% |
| 23000 | 6 | |
| 22000 | 11 | |
| 21000 | 3 | 0.1% |
| 20000 | 6 | |
| 19000 | 7 | |
| 18000 | 9 |
imdb_score
Real number (ℝ)
| Distinct | 78 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.4410564 |
| Minimum | 1.6 |
|---|---|
| Maximum | 9.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 1.6 |
|---|---|
| 5-th percentile | 4.4 |
| Q1 | 5.8 |
| median | 6.6 |
| Q3 | 7.2 |
| 95-th percentile | 8.015 |
| Maximum | 9.5 |
| Range | 7.9 |
| Interquartile range (IQR) | 1.4 |
Descriptive statistics
| Standard deviation | 1.1241073 |
|---|---|
| Coefficient of variation (CV) | 0.17452219 |
| Kurtosis | 0.94155654 |
| Mean | 6.4410564 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | -0.74046501 |
| Sum | 32192.4 |
| Variance | 1.2636172 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.7 | 221 | 4.4% |
| 6.6 | 200 | 4.0% |
| 7.2 | 193 | 3.9% |
| 6.4 | 185 | 3.7% |
| 6.5 | 184 | 3.7% |
| 7.3 | 184 | 3.7% |
| 7 | 181 | 3.6% |
| 7.1 | 181 | 3.6% |
| 6.8 | 180 | 3.6% |
| 6.1 | 178 | 3.6% |
| Other values (68) | 3111 |
| Value | Count | Frequency (%) |
| 1.6 | 1 | < 0.1% |
| 1.7 | 1 | < 0.1% |
| 1.9 | 3 | |
| 2 | 2 | |
| 2.1 | 3 | |
| 2.2 | 3 | |
| 2.3 | 3 | |
| 2.4 | 2 | |
| 2.5 | 2 | |
| 2.6 | 2 |
| Value | Count | Frequency (%) |
| 9.5 | 1 | < 0.1% |
| 9.3 | 1 | < 0.1% |
| 9.2 | 1 | < 0.1% |
| 9.1 | 3 | 0.1% |
| 9 | 3 | 0.1% |
| 8.9 | 5 | 0.1% |
| 8.8 | 7 | 0.1% |
| 8.7 | 13 | |
| 8.6 | 15 | |
| 8.5 | 24 |
aspect_ratio
Real number (ℝ)
| Distinct | 22 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2298299 |
| Minimum | 1.18 |
|---|---|
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 1.18 |
|---|---|
| 5-th percentile | 1.78 |
| Q1 | 1.85 |
| median | 2.35 |
| Q3 | 2.35 |
| 95-th percentile | 2.35 |
| Maximum | 16 |
| Range | 14.82 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 1.3452716 |
|---|---|
| Coefficient of variation (CV) | 0.60330681 |
| Kurtosis | 95.99054 |
| Mean | 2.2298299 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.645703 |
| Sum | 11144.69 |
| Variance | 1.8097556 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.35 | 2664 | |
| 1.85 | 1890 | |
| 1.78 | 108 | 2.2% |
| 1.37 | 100 | 2.0% |
| 1.33 | 67 | 1.3% |
| 1.66 | 64 | 1.3% |
| 16 | 45 | 0.9% |
| 2.39 | 15 | 0.3% |
| 2.2 | 14 | 0.3% |
| 4 | 7 | 0.1% |
| Other values (12) | 24 | 0.5% |
| Value | Count | Frequency (%) |
| 1.18 | 1 | < 0.1% |
| 1.2 | 1 | < 0.1% |
| 1.33 | 67 | |
| 1.37 | 100 | |
| 1.44 | 1 | < 0.1% |
| 1.5 | 2 | < 0.1% |
| 1.66 | 64 | |
| 1.75 | 3 | 0.1% |
| 1.77 | 1 | < 0.1% |
| 1.78 | 108 |
| Value | Count | Frequency (%) |
| 16 | 45 | 0.9% |
| 4 | 7 | 0.1% |
| 2.76 | 3 | 0.1% |
| 2.55 | 2 | < 0.1% |
| 2.4 | 3 | 0.1% |
| 2.39 | 15 | 0.3% |
| 2.35 | 2664 | |
| 2.24 | 1 | < 0.1% |
| 2.2 | 14 | 0.3% |
| 2 | 5 | 0.1% |
movie_facebook_likes
Real number (ℝ)
ZEROS 
| Distinct | 876 |
|---|---|
| Distinct (%) | 17.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7487.4302 |
| Minimum | 0 |
|---|---|
| Maximum | 349000 |
| Zeros | 2162 |
| Zeros (%) | 43.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 162.5 |
| Q3 | 3000 |
| 95-th percentile | 40000 |
| Maximum | 349000 |
| Range | 349000 |
| Interquartile range (IQR) | 3000 |
Descriptive statistics
| Standard deviation | 19290.727 |
|---|---|
| Coefficient of variation (CV) | 2.5764149 |
| Kurtosis | 41.774062 |
| Mean | 7487.4302 |
| Median Absolute Deviation (MAD) | 162.5 |
| Skewness | 5.083321 |
| Sum | 37422176 |
| Variance | 3.7213213 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2162 | |
| 1000 | 109 | 2.2% |
| 11000 | 82 | 1.6% |
| 10000 | 81 | 1.6% |
| 12000 | 61 | 1.2% |
| 13000 | 58 | 1.2% |
| 2000 | 56 | 1.1% |
| 15000 | 52 | 1.0% |
| 14000 | 49 | 1.0% |
| 16000 | 47 | 0.9% |
| Other values (866) | 2241 |
| Value | Count | Frequency (%) |
| 0 | 2162 | |
| 2 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 5 | 0.1% |
| 5 | 2 | < 0.1% |
| 7 | 3 | 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 3 | 0.1% |
| 10 | 2 | < 0.1% |
| 11 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 349000 | 1 | |
| 199000 | 1 | |
| 197000 | 1 | |
| 191000 | 1 | |
| 190000 | 1 | |
| 175000 | 1 | |
| 166000 | 1 | |
| 165000 | 1 | |
| 164000 | 1 | |
| 153000 | 1 |
| color | director_name | num_critic_for_reviews | duration | director_facebook_likes | actor_3_facebook_likes | actor_2_name | actor_1_facebook_likes | gross | genres | actor_1_name | movie_title | num_voted_users | cast_total_facebook_likes | actor_3_name | plot_keywords | movie_imdb_link | num_user_for_reviews | language | country | content_rating | budget | title_year | actor_2_facebook_likes | imdb_score | aspect_ratio | movie_facebook_likes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Color | James Cameron | 723 | 178 | 0 | 855 | Joel David Moore | 1000 | 760505847 | Action|Adventure|Fantasy|Sci-Fi | CCH Pounder | Avatar | 886204 | 4834 | Wes Studi | avatar|future|marine|native|paraplegic | http://www.imdb.com/title/tt0499549/?ref_=fn_tt_tt_1 | 3054 | English | USA | PG-13 | 237000000 | 2009 | 936 | 7.9 | 1.78 | 33000 |
| 1 | Color | Gore Verbinski | 302 | 169 | 563 | 1000 | Orlando Bloom | 40000 | 309404152 | Action|Adventure|Fantasy | Johnny Depp | Pirates of the Caribbean: At World's End | 471220 | 48350 | Jack Davenport | goddess|marriage ceremony|marriage proposal|pirate|singapore | http://www.imdb.com/title/tt0449088/?ref_=fn_tt_tt_1 | 1238 | English | USA | PG-13 | 300000000 | 2007 | 5000 | 7.1 | 2.35 | 0 |
| 2 | Color | Sam Mendes | 602 | 148 | 0 | 161 | Rory Kinnear | 11000 | 200074175 | Action|Adventure|Thriller | Christoph Waltz | Spectre | 275868 | 11700 | Stephanie Sigman | bomb|espionage|sequel|spy|terrorist | http://www.imdb.com/title/tt2379713/?ref_=fn_tt_tt_1 | 994 | English | UK | PG-13 | 245000000 | 2015 | 393 | 6.8 | 2.35 | 85000 |
| 3 | Color | Christopher Nolan | 813 | 164 | 22000 | 23000 | Christian Bale | 27000 | 448130642 | Action|Thriller | Tom Hardy | The Dark Knight Rises | 1144337 | 106759 | Joseph Gordon-Levitt | deception|imprisonment|lawlessness|police officer|terrorist plot | http://www.imdb.com/title/tt1345836/?ref_=fn_tt_tt_1 | 2701 | English | USA | PG-13 | 250000000 | 2012 | 23000 | 8.5 | 2.35 | 164000 |
| 4 | Color | Doug Walker | 110 | 103 | 131 | 369 | Rob Walker | 131 | 25445749 | Documentary | Doug Walker | Star Wars: Episode VII - The Force Awakens | 8 | 143 | unknown | none | http://www.imdb.com/title/tt5289954/?ref_=fn_tt_tt_1 | 156 | English | USA | Not Rated | 20000000 | 2005 | 12 | 7.1 | 2.35 | 0 |
| 5 | Color | Andrew Stanton | 462 | 132 | 475 | 530 | Samantha Morton | 640 | 73058679 | Action|Adventure|Sci-Fi | Daryl Sabara | John Carter | 212204 | 1873 | Polly Walker | alien|american civil war|male nipple|mars|princess | http://www.imdb.com/title/tt0401729/?ref_=fn_tt_tt_1 | 738 | English | USA | PG-13 | 263700000 | 2012 | 632 | 6.6 | 2.35 | 24000 |
| 6 | Color | Sam Raimi | 392 | 156 | 0 | 4000 | James Franco | 24000 | 336530303 | Action|Adventure|Romance | J.K. Simmons | Spider-Man 3 | 383056 | 46055 | Kirsten Dunst | sandman|spider man|symbiote|venom|villain | http://www.imdb.com/title/tt0413300/?ref_=fn_tt_tt_1 | 1902 | English | USA | PG-13 | 258000000 | 2007 | 11000 | 6.2 | 2.35 | 0 |
| 7 | Color | Nathan Greno | 324 | 100 | 15 | 284 | Donna Murphy | 799 | 200807262 | Adventure|Animation|Comedy|Family|Fantasy|Musical|Romance | Brad Garrett | Tangled | 294810 | 2036 | M.C. Gainey | 17th century|based on fairy tale|disney|flower|tower | http://www.imdb.com/title/tt0398286/?ref_=fn_tt_tt_1 | 387 | English | USA | PG | 260000000 | 2010 | 553 | 7.8 | 1.85 | 29000 |
| 8 | Color | Joss Whedon | 635 | 141 | 0 | 19000 | Robert Downey Jr. | 26000 | 458991599 | Action|Adventure|Sci-Fi | Chris Hemsworth | Avengers: Age of Ultron | 462669 | 92000 | Scarlett Johansson | artificial intelligence|based on comic book|captain america|marvel cinematic universe|superhero | http://www.imdb.com/title/tt2395427/?ref_=fn_tt_tt_1 | 1117 | English | USA | PG-13 | 250000000 | 2015 | 21000 | 7.5 | 2.35 | 118000 |
| 9 | Color | David Yates | 375 | 153 | 282 | 10000 | Daniel Radcliffe | 25000 | 301956980 | Adventure|Family|Fantasy|Mystery | Alan Rickman | Harry Potter and the Half-Blood Prince | 321795 | 58753 | Rupert Grint | blood|book|love|potion|professor | http://www.imdb.com/title/tt0417741/?ref_=fn_tt_tt_1 | 973 | English | UK | PG | 250000000 | 2009 | 11000 | 7.5 | 2.35 | 10000 |
| color | director_name | num_critic_for_reviews | duration | director_facebook_likes | actor_3_facebook_likes | actor_2_name | actor_1_facebook_likes | gross | genres | actor_1_name | movie_title | num_voted_users | cast_total_facebook_likes | actor_3_name | plot_keywords | movie_imdb_link | num_user_for_reviews | language | country | content_rating | budget | title_year | actor_2_facebook_likes | imdb_score | aspect_ratio | movie_facebook_likes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5033 | Color | Shane Carruth | 143 | 77 | 291 | 8 | David Sullivan | 291 | 424760 | Drama|Sci-Fi|Thriller | Shane Carruth | Primer | 72639 | 368 | Casey Gooden | changing the future|independent film|invention|nonlinear timeline|time travel | http://www.imdb.com/title/tt0390384/?ref_=fn_tt_tt_1 | 371 | English | USA | PG-13 | 7000 | 2004 | 45 | 7.0 | 1.85 | 19000 |
| 5034 | Color | Neill Dela Llana | 35 | 80 | 0 | 0 | Edgar Tancangco | 0 | 70071 | Thriller | Ian Gamazon | Cavite | 589 | 0 | Quynn Ton | jihad|mindanao|philippines|security guard|squatter | http://www.imdb.com/title/tt0428303/?ref_=fn_tt_tt_1 | 35 | English | Philippines | Not Rated | 7000 | 2005 | 0 | 6.3 | 2.35 | 74 |
| 5035 | Color | Robert Rodriguez | 56 | 81 | 0 | 6 | Peter Marquardt | 121 | 2040920 | Action|Crime|Drama|Romance|Thriller | Carlos Gallardo | El Mariachi | 52055 | 147 | Consuelo Gómez | assassin|death|guitar|gun|mariachi | http://www.imdb.com/title/tt0104815/?ref_=fn_tt_tt_1 | 130 | Spanish | USA | R | 7000 | 1992 | 20 | 6.9 | 1.37 | 0 |
| 5036 | Color | Anthony Vallone | 110 | 84 | 2 | 2 | John Considine | 45 | 25445749 | Crime|Drama | Richard Jewell | The Mongol King | 36 | 93 | Sara Stepnicka | jewell|mongol|nostradamus|stepnicka|vallone | http://www.imdb.com/title/tt0430371/?ref_=fn_tt_tt_1 | 1 | English | USA | PG-13 | 3250 | 2005 | 44 | 7.8 | 2.35 | 4 |
| 5037 | Color | Edward Burns | 14 | 95 | 0 | 133 | Caitlin FitzGerald | 296 | 4584 | Comedy|Drama | Kerry Bishé | Newlyweds | 1338 | 690 | Daniella Pineda | written and directed by cast member | http://www.imdb.com/title/tt1880418/?ref_=fn_tt_tt_1 | 14 | English | USA | Not Rated | 9000 | 2011 | 205 | 6.4 | 2.35 | 413 |
| 5038 | Color | Scott Smith | 1 | 87 | 2 | 318 | Daphne Zuniga | 637 | 25445749 | Comedy|Drama | Eric Mabius | Signed Sealed Delivered | 629 | 2283 | Crystal Lowe | fraud|postal worker|prison|theft|trial | http://www.imdb.com/title/tt3000844/?ref_=fn_tt_tt_1 | 6 | English | Canada | Not Rated | 20000000 | 2013 | 470 | 7.7 | 2.35 | 84 |
| 5039 | Color | unknown | 43 | 43 | 49 | 319 | Valorie Curry | 841 | 25445749 | Crime|Drama|Mystery|Thriller | Natalie Zea | The Following | 73839 | 1753 | Sam Underwood | cult|fbi|hideout|prison escape|serial killer | http://www.imdb.com/title/tt2071645/?ref_=fn_tt_tt_1 | 359 | English | USA | TV-14 | 20000000 | 2005 | 593 | 7.5 | 16.00 | 32000 |
| 5040 | Color | Benjamin Roberds | 13 | 76 | 0 | 0 | Maxwell Moody | 0 | 25445749 | Drama|Horror|Thriller | Eva Boehnke | A Plague So Pleasant | 38 | 0 | David Chandler | none | http://www.imdb.com/title/tt2107644/?ref_=fn_tt_tt_1 | 3 | English | USA | Not Rated | 1400 | 2013 | 0 | 6.3 | 2.35 | 16 |
| 5041 | Color | Daniel Hsia | 14 | 100 | 0 | 489 | Daniel Henney | 946 | 10443 | Comedy|Drama|Romance | Alan Ruck | Shanghai Calling | 1255 | 2386 | Eliza Coupe | none | http://www.imdb.com/title/tt2070597/?ref_=fn_tt_tt_1 | 9 | English | USA | PG-13 | 20000000 | 2012 | 719 | 6.3 | 2.35 | 660 |
| 5042 | Color | Jon Gunn | 43 | 90 | 16 | 16 | Brian Herzlinger | 86 | 85222 | Documentary | John August | My Date with Drew | 4285 | 163 | Jon Gunn | actress name in title|crush|date|four word title|video camera | http://www.imdb.com/title/tt0378407/?ref_=fn_tt_tt_1 | 84 | English | USA | PG | 1100 | 2004 | 23 | 6.6 | 1.85 | 456 |